WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




per 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 
C12Q 1/70, G01N 33/53 
C12N7/00, 15/00, 1/20 
G01N 33/566 



Al 



(11) International Publication Number: WO 91/17271 

(43) International Publication Date: 14 November 1991 (14.1 1.91) 



(21) International Application Number: 

(22) International Filing Date: 



PCT/US9 1/02989 
1 May 1991 (01.05.91) 



(30) Priority data: 

517,659 



1 May 1990(01.05.90) 



US 



(60) Parent Application or Grant 

(63) Related by Continuation 
US 

Filed on 



517,659 (CIP) 
1 May 1990 (01.05.90) 



(71) Applicant (for all designated States except US): AFFYMAX 

TECHNOLOGIES N.V. [NL/NLJ; De Ruyderkade 62, 
Curacao (AN). 

(72) Inventors ; and 

(75) Inventors/Applicants (for US only) : DOWER, William, J. 
[US/US]; 761 Partridge Avenue, Menlo Park, CA 94025 
(US). CWIRLA, Steven, E. [US/US]; 765 San Antonio 
Road, #64, Palo Alto, CA 94303 (US). 



(74) Agent: SMITH, William, M.; Townsend and Townsend, 
One Market Plaza, 2000 Steuart Tower, San Francisco, 
CA 94105 (US). 



(81) Designated States: AT, AT (European patent), AU, BB, BE 
(European patent), BF (OAPI patent), BG, BJ (OAPI 
patent), BR, CA, CF (OAPI patent), CG (OAPI patent), 
CH, CH (European patent), CI (OAPI patent), CM 
(OAPI patent), DE, DE (European patent), DK, DK 
(European patent), ES, ES (European patent), FI, FR 
(European patent), GA (OAPI patent), GB, GB (Euro- 
pean patent), GR (European patent), HU, IT (European 
patent), JP, KP, KR, LK, LU, LU (European patent), 
MC, MG, ML (OAPI patent), MR (OAPI patent), MW, 
NL, NL (European patent), NO, PL, RO, SD, SE, SE 
(European patent), SN (OAPI patent), SU, TD (OAPI 
patent), TG (OAPI patent), US. 



Published 

With international search report. 



(54) Title: RECOMBINANT LIBRARY SCREENING METHODS 



(57) Abstract 

Nucleotide sequences encoding proteins of interest are isolated from DNA libraries using bacteriophage to link the protein 
to the sequence which encodes it. DNA libraries are prepared from cells encoding the protein of interest and inserted into or ad- 
jacent to a coat protein of a bacteriophage vector, or into a sequence encoding a protein which may be linked by means of a li- 
gand to a phage coat protein. By employing affinity purification techniques the phage particles containing sequences encoding 
the desired protein may be selected and the desired nucleotide sequences obtained therefrom. Thus, for example, novel proteins 
such as monoclonal antibodies may be produced and conventional hybridoma technology avoided. 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


ES 


Spain 


MG 


Madagascar 


AU 


Australia 


FI 


Finland 


ML 


Mali 


BB 


Barbados 


FR 


France 


MN 


Mongolia 


BE 


Belgium 


GA 


Gabon 


MR 


Mauritania 


BP 


Burkina Faso 


GB 


United Kingdom 


MW 


Malawi 


BG 


Bulgaria 


GN 


Guinea 


NL 


Netherlands 


BJ 


Benin 


GR 


Greece 


NO 


Norway 


BR 


Brazil 


HU 


Hungary 


PL 


Poland 


CA 


Canada 


IT 


Italy 


RO 


Romania 


CF 


Central African Republic 


JP 


Japan 


SD 


Sudan 


CG 


Congo 


KP 


Democratic People's Republic 


SE 


Sweden 


CH 


Switzerland 




or Korea 


SN 


Senegal 


CI 


Cote d*I voire 


KR 


Republic or Korea 


SU 


Soviet Union 


CM 


Cameroon 


LI 


Liechtenstein 


TD 


Chad 


CS 


Czechoslovakia 


LK 


Sri Lanka 


TG 


Togo 


DE 


Germany 


LU 


Luxembourg 


US 


United Slates of America 


DK 


Denmark 


MC 


Monaco 







WO 91/17271 



PCI7US91/02989 



1 



5 RECOMBINANT LIBRARY SCREENING METHODS 

Field of the Invention 
The present invention relates generally to 
recombinant DNA technology and, more particularly, to 
10 methods for screening DNA libraries for DNA sequences 

that encode proteins of interest. 



Background of the Invention 

15 Isolating a gene which encodes a desired 

protein from a recombinant DNA library can be a daunting 
task. Hybridization probes may facilitate the process, 
but their use is generally dependent on knowing at least 
a portion of the sequence of the gene which encodes the 

20 protein. When the sequence is not known, DNA libraries 

have been expressed in an expression vector and 
antibodies have been used to screen plaques or colonies 
for the desired protein antigen. This procedure has been 
useful in screening small libraries, but sequences which 

25 are represented in less than about 1 in 10 5 clones are 

easily missed, and screening libraries larger than 10 6 
can be difficult. 

Antibody molecules are comprised of light and 
heavy polypeptide chains, each having a distinct variable 

3 0 (V) region, the combination of which produces an antigen 

binding region. Based on random combination events of 

heavy and light chains in any one antibody-producing 

cell, the potential repertoire of antibody heavy and 

12 

light chain combinations may be as much as 10 or 
3 5 greater. Thus, to sample a large fraction of this 

repertoire and obtain clones which express an antibody 
having a desired antigen binding specificity, an 
extremely large library may have to be constructed and 
screened. 
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Methods are needed which facilitate the 
screening process, thereby enabling DNA sequences which 
encode proteins of interest, and particularly antibody 
molecules, to be more readily identified, recloned and < 
5 expressed. Were such procedures available, it may become 

possible to probe an animal's entire antibody repertoire, % 
for example, to obtained an antibody to a preselected 
target molecule. In this manner the difficulties and 
labor intensive process of generating monoclonal 
10 antibodies, regardless of the species of origin, by 

conventional hybridization or transformation of 
lymphoblastoid cells, may be avoided. Quite 
surprisingly, the present invention fulfills these and 
other related needs. 



Summary of the Invention 
Methods are provided for screening a DNA 
library for a nucleotide sequence which encodes a protein 

20 of interest. The methods generally involve physically 

linking the protein of interest, in a biologically active 
form (usually having a binding activity) , to the specific 
DNA sequence encoding that protein. This allows the 
isolation and identification of that DNA by means of 

25 affinity techniques relying on the binding activity of 

the protein of interest. 

In one aspect of the invention a bacterial host 
cell is transformed or infected with a bacteriophage 
expression vector, which vector comprises a DNA library 

30 member joined to a first nucleotide sequence encoding a 

tag protein. The vector also contains a second 
nucleotide sequence that encodes a tag ligand peptide 
that specifically binds the tag protein, which second 
sequence is joined to a third nucleotide sequence 

35 encoding a coat protein of the bacteriophage particle. 

The transformed or infected host cell is then cultivated 
under conditions suitable for expression and assembly of 
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the bacteriophage particles and the association of the 
tag protein with the tag ligand peptide on the surface of 
the phage particle* Particles which encode the protein 
of interest are then selected from the culture by an 
5 affinity enrichment technique • This is accomplished by 

means of a ligand specific for the protein of interest, 
such as an antigen if the protein of interest is an 
antibody. Repeating the affinity selection procedure 
provides an enrichment of clones encoding the desired 

10 sequences, which may then be isolated for sequencing, 

further cloning and/or expression. 

In another embodiment the invention concerns 
methods for screening a DNA library expressing one or 
more polypeptide chains that are processed, folded and 

15 assembled in the periplasmic space to achieve hi logical 

activity. A non-lytic phage vector is preferred for this 
purpose. Particularly preferred examples of such vectors 
are the filamentous phage fd, fl and M13. In this 
embodiment a library of DNA sequence members, each joined 

20 to a first nucleotide sequence coding for a tag protein, 

is cloned into an appropriate location of the phage 
genome, behind an appropriate promoter and translation 
sequences and a sequence encoding a signal peptide leader 
directing transport of the downstream fusion protein to 

25 the periplasmic space. The phage vector also contains a 

second DNA sequence inserted into a coat protein gene to 
express a tag ligand peptide acting as a ligand for the 
tag protein, which peptide is expressed in a location of 
the coat protein exposed to the external environment of 

3 0 the phage and, so, is accessible for binding by the tag 

protein. In a preferred embodiment this peptide is 
located at or near the N-terminus of the pill coat 
protein. The protein (s) of interest are expressed and 
transported to the periplasmic space, and the properly 

3 5 assembled proteins are adsorbed to the phage particle by 

virtue of the interaction of the tag protein with the 
ligand peptides on the phage as the phage particles are 
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extruded from the cell. Phage bearing the desired 
protein are then selected by means of a ligand specific 
for the protein of interest. 

In yet another embodiment the invention * 
5 concerns methods for screening a DNA library whose 

members are joined to a nucleotide sequence encoding a 
coat protein of a bacteriophage vector. The phage may be 
a filamentous phage, such as, for example, fd, fl, or 
M13. Typically the DNA library sequences are inserted in 

10 the 5 1 region of a gene which encodes a phage coat 

protein, such as pill. Thus, a phage coat protein fused 
to a DNA library member-encoded protein is produced and 
assembled in to the viral particle. For those proteins 
whose activity requires more than one chain, the second 

15 and any subsequent chain (s) are expressed from the phage 

genome so as to be transported to the periplasm where 
they assemble with the first chain that is fused to the 
phage coat protein, which complex associates with the 
phage particle as it exits the cell. Phage particles 

20 which encode the protein of interest are selected by 

means of a ligand specific for the protein. 

Description of the Specific Embodiments 
Methods and compositions are provided by the 
25 present invention for conveniently identifying clones of 

desired recombinantly-produced proteins. A method of 
affinity enrichment allows for the screening of libraries 
to identify clones having desired ligand specificities, 

q 

where up to about 10 or more clones may be readily 
30 screened. This represents a significant improvement over 

the art, where conventional procedures typically allow 
about 10 6 clones in a DNA library to be screened, i 

7 

sometxmes up to 10 clones but with proportionate 
increases in time and labor. , 
35 In one aspect the invention involves linking 

the desired protein, such as an antibody molecule, to the 
DNA which encodes it. By then enriching for the protein, 
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such as by affinity techniques, for example, the DNA 
which encodes the protein is also enriched and may then 
be isolated • The DNA so obtained may then be cloned and 
expressed in other systems, yielding potentially large 
5 quantities of the desired protein, or may be subjected to 

sequencing and further cloning and genetic manipulations 
prior to expression. 

The protein for which the DNA is enriched and 
cloned according to the present invention is typically an 

10 antibody or fragment thereof, but may also be any protein 

which may be cloned from a nucleotide library. In 
addition to antibodies, such proteins may include, for 
example, growth hormones, interferons, interleukins, 
hormones, enzymes, zymogens, etc. Proteins which may be 

15 cloned are those for which specific binding partners 

(e.g. antigen or hapten when the desired protein is an 
antibody) have been identified. 

When the protein of interest is an antibody of 
a desired binding specificity, the antibody may be of any 

2 0 of the known isotypes or subclasses for a particular 

species, and may be a single-chain or two-chain binding 
complex or portion thereof. For instance, only the 
variable antigen-binding regions of heavy (V R ) and/or 
light (V L ) chains may be identified and cloned; the 

25 binding fragments (F y ) or Fab encoded thereby may find 

use either as a binding fragment, joined to constant 
regions of heavy or light chains, or joined to other 
proteins having desired effector functions. The 
characteristics of the constant region domains will 

30 depend to a large extent on the use intended for the 

antibody, e.g., diagnostic and/or therapeutic 
applications, catalytic antibodies, etc. 

Typically the protein's binding partner, e.g., 
an antigen or hapten when the protein is an antibody, is 

35 known, and the methods herein provide a means for 

creating and/or identifying a protein (and the DNA which 
encodes the protein) which specifically binds the binding 
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partner of interest. Thus when the protein is an 
antibody the present invention provides a novel means for 
producing antibodies, particularly monoclonal antibodies, 
to predetermined antigens and antigenic determinants, 
5 thereby circumventing the laborious, time-consuming and 

often unpredictable process of conventional monoclonal 
antibody technology. Although murine monoclonal antibody 
production is often relatively straightforward, it is 
labor intensive. Furthermore, the development of human 
10 monoclonal antibodies by conventional approaches has been 

hampered by a variety of technical difficulties which, to 
a large extent, would be circumvented by the present 
invention. 

According to the present invention, a DNA 
15 library is prepared from cells which are capable of 

encoding the desired protein. A variety of techniques 
exist for preparing the library, which may be prepared 
from either genomic DNA or cDNA. See, e.g., Sambrook et 
al., Molecular Cloning. A Laboratory Manual. 2nd ed. , 

2 0 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

N.Y., 1989, which is incorporated herein by reference. 
It is understood that when referring herein to DNA it is 
meant to include both genomic and cDNA, unless otherwise 
specified. The cells which serve as the source of RNA or 
25 DNA may be any which are capable of encoding the protein 

of interest. Enrichment procedures and means for 
amplifying the regions containing the gene(s) , if known, 
may be employed. For instance, when the desired protein 
is an antibody, RNA and cDNA may be prepared from spleen 

3 0 cells from unimmunized animals, from animals immunized 

with antigens or haptens of interest, hybridoma cells, or 
lymphoblastoid cells, for example. The use of spleen 
cells from unimmunized animals provides a better 
representation of the possible antibody repertoire, while 
35 spleen cells from immunized animals are enriched for 

sequences directed against epitopes of the immunizing 
antigen or haptens. The cells may be obtained from a 



WO 91/17271 



PCT/US91/02989 



variety of animal species, such as human, mouse, rat, 
lagomorpha, equine, bovine, avian, etc., the selection 
often dependent on the protein of interest and the use 
for which it is intended. 
5 Amplification of sequences representing 

messenger RNA (mRNA) isolated from cells of interest, 
such as spleen or hybridoma cells, may be performed 
according to protocols outlined in, e.g., U.S. 4,683,202, 
Orlandi, et al. Proc. Natl. Acad. Sci. USA 86:3833-3837 

10 (1989), Sastry et al., Proc. Natl. Acad. Sci. USA 

86:5728-5732 (1989), and Huse et al. Science 246:1275- 
1281 (1989) , each incorporated herein by reference. 
Oligonucleotide primers useful in amplification protocols 
may be unique or degenerate or incorporate inosine at 

15 degenerate positions. Thus, for multi-chain 

immunoglobulins, primers would be generally used for 
amplification of sequences encoding the variable regions 
of both the heavy and light chains. Restriction 
endonuclease recognition sequences may be incorporated 

20 into the primers to allow for the cloning of the 

amplified fragment into a vector in a predetermined 
reading frame for expression. 

Expression libraries containing the amplified 
cDNA are typically prepared in a vector such as a 

25 bacteriophage. The characteristics of the suitable 

bacteriophage depends on the specific embodiment 
employed, and will generally be those which conveniently 
allow insertion of the recombinant DNA into host cells by 
in vitro packaging or genetic transformation, which 

3 0 infect host cells capable of expressing the desired 

proteins, and whose DNA contains restriction sites, 
useful for cloning, which are located in regions of the 
phage genome where insertion of foreign DNA will not 
substantially disrupt essential functions of the phage. 

35 Thus, in one embodiment of the invention 

whereby the protein is processed and assembled into a 
functional form and associates with the bacteriophage 
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particles within the cytoplasm of the host cell, 
bacteriophage A and derivatives thereof are examples of 
suitable vectors. When used for expression of antibody 
sequences, such as V H , V L , F v (variable region fragment) 
5 or Fab, library DNA may be inserted into A vector, as 

described generally by Huse et al., supra, and Short et 
al., Nucleic Acids Res. 16:7583 (1988). The vector may 
be designed to be asymmetric with respect to restriction 
sites that flank the cloning and expression sequences. 
10 This asymmetry allows efficient recombination of 

libraries coding for separate chains of the active 
protein. 

When the desired protein is an immunoglobulin, 
a library expressing antibody light chain binding regions 

15 may be combined with one expressing antibody heavy chain 

binding regions, thereby constructing combinatorial 
antibody or Fab expression libraries. For instance, one 
A vector is designed to serve as a cloning vector for 
antibody light chain sequences, and another A vector is 

20 designed to serve as a cloning vector for antibody heavy 

chain sequences in the initial steps of library 
construction. A combinatorial library is constructed 
from the two A libraries by crossing them at an 
appropriate restriction site. DNA is first purified from 

25 each library, and the right and left arms of each 

respective A vector cleaved so as to leave the antibody 
chain sequences intact. The DNAs are then mixed and 
ligated, and only clones that result from proper assembly 
of reciprocal vectors reconstitute as viable phage. 

3 0 The invention also contemplates alternative 

methods for achieving efficient recombination of 
libraries encoding the separate protein chains. These 
alternatives include the use of multiple, rare cloning 
sites (e.g., Not I and Sfi I) used in subcloning 

35 sequences from one library into the other; or using phage 

vectors which exist in double stranded, plasmid-like form 
at some point in their life cycle (e.g., replicative form 
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"RF" DNA of filamentous phage) , and to clone the separate 
libraries in different versions of the vector containing 
different cloning sites and different selectable markers 
(Amp R and Tet R , for example) . The sequences from the 
5 separate libraries are joined so that a sequence of 

interest and a corresponding selectable marker from each 
library resides in each single phage genome. Application 
of double selection will enrich for those hosts harboring 
the combined phage genomes. Alternatively, in accordance 

10 with the present invention antibody light and heavy 

chains can be expressed from separate plasmids. 

In other embodiments the V region domains of 
heavy and light chains can be expressed on the same 
polypeptide, joined by a flexible linker to form a 

15 single-chain Fv fragment, which gene sequences can then 

be cloned into the desired vector. As generally 
described in McCafferty et al., Nature 348:552-554 (Dec. 
6, 1990), complete antibody V domains of an anti-lysozyme 
antibody, joined by a flexible (Gly 4 -Ser) 3 linker and 

20 cloned into an fd phage vector fdCATl at the N-terminal 

region of the gene III protein, produce an antibody-gene 
III fusion which is detectable in the recombinant phage. 
The recombinant phage can be affinity-purified over a 
lysozyme-Sepharose® column and subsequently further 

25 enriched by propagating the isolated phage. 

In yet another approach, one or both of the 
antibody light and heavy chains (or the V regions 
thereof) can be expressed by a phagemid, that is, a 
vector which combines features of plasmids, such as a 

3 0 bacterial ColEl origin of replication, and filamentous 

bacteriophages, such as the major intergenic region. 
Phagemids are discussed genrally in Sambrooke et al., 
supra . at pages 4.17-4.19 and 4.44-4.50, which are 
incorporated herein by reference. The vector can be 

35 maintained as a small plasmid vector by selection, or the 

plasmid can be efficiently packaged into virions , or 
phagemid particles, by infection with helper phage, such 
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as M13K07. As described generally in Bass et al., 
Proteins 8:309-314 (1990) , controlling the copy number of 
gene III fusion proteins which are produced in a mixture 
of both wild-type gene III and a gene III fusion protein 
5 on phagemid particles permits proper folding of a fusion 

protein while allowing efficient panning of large epitope 
libraries. Thus, it is possible to sort phagemid 
particles whose peptides are of relatively high affinity, 
with binding constants in the micromolar to nanomolar 
10 range. 

As mentioned, the vectors may be engineered to 
efficiently clone the library amplification products. 
For example, oligonucleotides may be used to introduce 
the asymmetric restriction sites, a ribosome binding site 
15 at an optimal distance for expression of the cloned 

sequence, and cloning sites for the library amplification 
products . 

In certain embodiments of the invention at 
least one of the library-encoded protein chains is cloned 

20 into a vector so as to be expressed fused to a tag 

protein. As used herein, a "tag" protein is meant to 
refer to a protein which has a specific binding affinity 
for a peptide or protein ligand. For example, the tag 
protein should have specificity for a peptide or protein 

25 ligand of at least about 3 to about 100 or more amino 

acids, preferably with an affinity exceeding at least 
about 10" 7 to 10" 8 M, and more preferably equal to or 
greater than about 10~ 9 M. A tag protein with more than 
one binding site for the tag ligand peptide provides a 

30 resulting increase in avidity, which greatly decreases 

the dissociation rate and somewhat relaxes the 
requirement for affinity at each site (the affinity of 
each binding site may then be as low as about 10~ 6 M) . 
The tag protein should be preferably less than about 

35 lOOkD but may be more, and should be of a composition 

such that its fusion with the protein of interest, e.g., 
the N- terminus of the tag with the C- terminus of an 
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antibody heavy chain fragment, does not interfere with 
the binding of tag protein to the tag ligand peptide. 
The tag protein should also be such that its presence on 
the C-terminus of the protein (e.g., antibody fragment) 
5 does not in general alter the specificity of, or 

seriously reduce the affinity of the protein for its 
ligand (e.g., of an antibody for its antigen). 

A sequence encoding the tag peptide ligand is 
engineered into or adjacent to a gene encoding a 

10 bacteriophage coat protein so as to display the ligand on 

the outer surface of the phage particle. Thus, when the 
foreign library DNA is expressed as a fusion protein with 
the tag protein, the specific binding pair ligand member 
will be accessible to bind the tag/desired protein fusion 

15 protein. If the desired library encoded protein is a 

multi-chain protein, such as an antibody, generally only 
one of the chains (preferably the heavy chain if an 
antibody) is expressed fused to the tag protein. To 
minimize interference with the antigen binding region of 

20 the antibody or fragment thereof (which binding is used 

in the panning procedure described further below to 
isolate the phage encoding the desired antibody or other 
protein of interest) the tag protein is typically fused 
at the heavy chain C-terminus or the C-terminus of the 

25 desired fragment. 

The density of tagging of the phage particles 
may be controlled by expressing the ligand peptide in a 
plasmid carrying an additional copy of the phage protein 
expressed at the desired level. In this case only a 

30 portion of the proteins going into the assembly of the 

phage would contain the peptide ligand to the tag 
protein. 

Host cells are then infected with the phage, 
and cultivated under conditions allowing for the 
3 5 expression and assembly of phage particles. The 

appropriate host cells for bacteriophage A are various 
strains of E. coli, specific examples depending on which 
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of the several suitable A vectors is chosen. Of course, 
phage having bacterial hosts other than E. coli may also 
be used. 

Under the appropriate induction, the library- 
5 encoded protein chain-tag (and other library- encoded 

chains if the protein is multi-chained) is expressed and 
allowed to assemble in the bacterial cytoplasm. It 
should be noted that the assembly of some multi-chain 
proteins, such as some antibodies, may be hindered to 

10 some extent in the bacterial cytoplasm. The induction of 

the protein (s) may be delayed until some replication of 
the phage genome, synthesis of some of the phage 
structural proteins, and assembly of some phage particles 
has occurred. The assembled protein chains then interact 

15 with the phage particles via the binding of the tag 

protein with its peptide ligand on the outer surface of 
the phage particle. The cells are lysed and the phage 
bearing the library-encoded receptor protein (that 
corresponds to the specific library sequences carried in 

2 0 the DNA of that phage) are released and isolated from the 

bacterial debris. Once the cells lyse, any further 
binding of the antibodies to the phage should be 
prevented to minimize spurious binding between receptor 
protein and phage particles originating from different 
25 cells. Because the ligand peptide on the phage surface 

may be in excess, an amount of the tag protein is added 
to the cell suspension before lysis sufficient to bind 
and block unoccupied peptide on the surface of phage 
released from the cells. 

3 0 To enrich for and isolate phage which contain 

cloned library sequences that encode a desired protein, 
and thus to ultimately isolate the nucleic acid sequences 
themselves, phage harvested from the bacterial debris are 
affinity purified. A ligand or binding partner specific 
35 for the desired cloned library protein is used in the 

affinity purification. For example, when the desired 
protein is an antibody which specifically binds a 
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particular antigen or antigenic determinant, the antigen 
or determinant is used to retrieve phage having the 
desired protein on or a part of its outer surface. The 
ligand is typically adsorbed to an insoluble substrate , 
5 such as a particle or bead or plate. The phage so 

obtained may then be amplified by infecting into host 
cells. Additional rounds of affinity enrichment, as 
great as or greater than 10 4 fold per round, and 
amplification may be employed until the desired level of 

10 enrichment is reached or the target phage are no longer 

enriched relative to the background phage. 

The enriched antibody-phage are also screened 
with additional detection techniques such as expression 
plaque (or colony) lift (see, e.g., Young and Davis, 

15 Science , 222:778-782 (1983), incorporated herein by 

reference) whereby the same or another binding partner is 
used as a probe. Screening may employ additional assays 
(for a catalytic activity, for example) which are used to 
detect, in situ, plaques expressing proteins having the 

20 desired characteristics. The phage obtained from the 

screening protocol are infected into cells, propagated, 
and the phage DNA isolated and sequenced, and/or recloned 
into a vector intended for gene expression in prokaryotes 
or eukaryotes to obtain larger amounts of the particular 

25 protein selected. 

In another embodiment, the library-encoded 
desired protein (or multiple chains comprising said 
protein) is transported to an extra-cytoplasmic 
compartment of the host cell, usually the periplasmic 

3 0 space, to facilitate processing and/or proper assembly. 

This may be preferred when the desired protein is an 
antibody or the antigen binding region of antibody heavy 
and light chains. When extra-cytoplasmic transport of 
the desired protein is employed, the sequences encoding 

35 one or more polypeptide chains of the protein are cloned 

adjacent to appropriate transcriptional and translational 
signals and signal peptide leaders that will direct the 



WO 91/17271 



PCT/US91/02989 



14 

mature chains to the periplasm. As above, however , at 
least one of the chains is cloned adjacent to or within a 
tag protein so as to be expressed as a fusion of the 
protein of interest and the tag protein. In the case of 
5 antibody chain fragments, the tag is preferably fused to 

the C-terminus of the antibody chain (usually the heavy 
chain, but alternatively the light chain) to minimize 
interference with the antigen binding region. For other 
proteins the tag may be placed in an N-terminal or C- 

10 terminal location or internally, but in any event the tag 

protein should not substantially interfere with the 
ability of the protein of interest to bind a specific 
binding partner which is used in the affinity enrichment 
protocol described herein. 

15 In the above embodiment a sequence that encodes 

a tag ligand peptide is engineered into a sequence 
encoding a phage coat protein. A suitable location for 
the ligand may be, for example, the N-terminal region of 
a coat protein, particularly that of a filamentous phage 

20 such as fd, fl, M13, etc. The coat protein is chosen for 

its ability to incorporate a tag ligand at or near the N- 
or C-terminal and yet be appropriately assembled into the 
coat of a phage particle. 

A preferred example of this embodiment is the 

25 placement of a ligand peptide in the N-terminus region of 

the minor coat protein pill of bacteriophage fd. Before 
incorporation into the phage, pill resides in the inner 
membrane of the host cell with its N-terminus protruding 
into an extracytoplasmic compartment, probably the 

30 periplasm. In this configuration the tag ligand peptide 

in the N-terminus of pill is available for binding to the 
tag protein fused to the protein of interest. This 
complex is then incorporated into the mature phage 
particle as it exits the cell and the C-terminus embeds 

35 in the coat of the phage. 

Under conditions appropriate for expression of 
the library-encoded protein, tag protein and ligand-coat 
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protein, the protein of interest is expressed and 
transported to the periplasmic space where conditions are 
conducive for proper processing, folding and assembly (as 
is the case for antibody chains) . Because at least one 
5 chain of the assembled protein is fused to a tag protein, 

the assembled protein of interest binds to the phage 
particle via the tag ligand as phage are extruded from 
the cell. The phage bearing the active protein of 
interest are then isolated by the affinity enrichment 

10 method described below and the DNA coding for the protein 

of interest is thereby obtained. 

In yet another embodiment, the desired protein 
is also transported to an extra-cytoplasmic compartment 
of the host cell, such as the bacterial periplasm, but as 

15 a fusion protein with a viral coat protein. In this 

embodiment the desired protein (or one of its polypeptide 
chains if it is a multi-chain protein, such as an 
antibody) is expressed fused to a viral coat protein 
which is processed and transported to the cell inner 

20 membrane. Other chains, if present, are expressed with 

a secretion leader and thus are also transported to the 
periplasm or other intracellular but extra-cytoplasmic 
location. The chains (e.g., light and heavy chains) 
present in the extra-cytoplasm then assemble into a 

25 complete protein (or binding fragment thereof) . The 

assembled molecules become incorporated into the phage by 
virtue of their attachment to the phage coat protein as 
the phage extrude through the host membrane and the coat 
proteins assemble around the phage DNA. The phage 

3 0 bearing the antibody complex may then be screened by 

affinity enrichment. 

In this embodiment the synthesis and 
amplification of cDNAs is prepared as described above, 
and then is cloned into or near a vector sequence 

35 encoding a coat protein, where the vector is, or is 

derived from, a filamentous phage, such as fl, fd, Pfl, 
M13, etc. In a preferred embodiment the filamentous 
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phage is fd-tet. The phage vector is chosen to contain a 
cloning site located in the 5 1 region of a gene encoding 
a phage coat protein , such as, for example, the pill coat 
protein. An appropriate vector (e.g., fd-tet Bl which is 
5 described below) allows oriented cloning of foreign 

sequences so that they are expressed at or near the N- 
terminus of the mature coat protein. 

A library is constructed by cloning the cDNA 
(e.g., the V H region) from the donor cells into a coat 

10 protein gene (e.g., gene III, "gill") cloning site. The 

cloned sequences of, for example, the V H domains are 
ultimately expressed as polypeptides or proteins (of up 
to about 120 amino acids in the case of the V R protein) 
fused to the N-terminus of the mature coat protein on the 

15 outer, accessible surface of the assembled phage 

particles. Although a large peptide fragment near the N- 
terminus of the coat protein may cause a decrease in the 
phage infectivity and/or yield compared to phage with 
much smaller, similarly-placed fragments, the larger 

20 fragments may still be effectively enriched by the 

procedures described herein. 

Some peptides, because of their size and/or 
sequence, may cause severe defects in the infectivity of 
their carrier phage. This causes a loss of phage from 

25 the population during reinfection and amplification 

following each cycle of panning. To minimize problems 
associated with defective infectivity, DNA prepared from 
the eluted phage is transformed into host cells by 
electroporation or well known chemical means. The cells 

3 0 are cultivated for a period of time sufficient for marker 

expression, and selection is applied as typically done 
for DNA transformation. The colonies are amplified, and 
phage harvested as described below for a subsequent 
round (s) of panning. 

35 When the desired protein is a multi-chain 

protein, such as an antibody or binding fragment thereof, 
the cDNA encoding the chain (s) not cloned into a phage 
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coat protein may be cloned directly into an appropriate 
site (as described below) of the vector containing the 
first chain-coat protein library; or, preferably, the 
subsequent chain (s) may be cloned as a separate library 
5 in a different plasmid vector, amplified, and 

subsequently the fragments installed in the first chain- 
coat protein library vector. For example, when the first 
chain is an antibody heavy chain or binding fragment 
thereof, the ultimate destination of light chain V L cDNA 

10 sequence is in a vector phage RF DNA that already 

contains a V R sequence in a coat protein gene, thus 
randomly recombining V H and V L sequences in a single 
phage genome. 

The second or subsequent chain of the desired 

15 multi-chain protein, such as V L , is cloned so that it is 

expressed with a signal peptide leader sequence that will 
direct its secretion into the periplasm of the host cell. 
For example, several leader sequences have been shown to 
direct the secretion of antibody sequences in E. coli, 

20 such as OmpA (Hsiung, et al., Biotechnology 4:991-995 

(1986)), pelB (Better, et al., Science 240:1041-1043 
(1988)), phoA (Skerra and Pluckthun, Science 240:1038- 
1043 (1988)), and 6-lactamase (Zemel-Dreasen and Zamir, 
Gem 7:315-322 (1984)). 

25 The cloning site for the subsequent chain 

cDNA's should be placed so that it does not substantially 
interfere with normal phage function. One such locus is 
the intergenic region as described by Zinder and Boeke, 
Gene 19:1-10 (1982). The V L sequence is preferably 

3 0 expressed at an equal or higher level than the V H /pIII 

product to maintain a sufficiently high V L concentration 
in the periplasm to provide efficient assembly 
(association) of V L with V R chains. 

Generally, the successful cloning strategy 

35 utilizing a phage coat protein, such as pill of 

filamentous phage fd, will provide: (1) expression of a 
protein chain (or a first polypeptide chain when the 
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desired protein is multi-chained, e.g., the V H chain) 
fused to the N-terminus of a full sized (or nearly full 
sized) coat protein (e.g., pill) and transport to the 
inner membrane of the host where the hydrophobic domain 
5 in the C-terminal region of the coat protein anchors the 

fusion protein in the membrane, with the N-terminus 
containing the chain protruding into the periplasmic 
space and available for interaction with a second or 
subsequent chain (e.g., V L to form an F v or Fab fragment) 

10 which is thus attached to the coat protein; (2) adequate 

expression of a second or subsequent polypeptide chain if 
present (e.g., V L ) and transport of this chain to the 
soluble compartment of the periplasm; and (3) will 
usually, but not necessarily, produce little or no 

15 interference with normal phage function or host cell 

viability. 

The number of possible combinations of heavy 
and light chains probably exceeds 10 12 . To sample as 
many combinations as possible depends, in part, on the 

20 ability to recover large numbers of transf ormants. For 

phage with plasmid-like forms (as filamentous phage) , 
electrotrans formation provides an efficiency comparable 
to that of phage A transf ect ion with in vitro packaging, 
in addition to a very high capacity for DNA input. This 

25 allows large amounts of vector DNA to be used to obtain 

very large numbers of transf ormants. The method 

described by Dower et al., Nucleic Acids Res. . 16:6127- 

6145 (1988) , incorporated by reference herein, may be 

used to transform fd-tet derived recombinants at the rate 
7 

3 0 of about 10 transf ormants/ /xg of lxgated vector into 

E. coli (such as strain MC1061) , and libraries may be 
constructed in fd-tet Bl of up to about 3 x 10 8 members 
or more. Increasing DNA input and making modifications 
to the cloning protocol within the ability of the skilled 

35 artisan may produce increases of greater than about 10- 

fold in the recovery of transf ormants, providing 
libraries of up to 10 10 or more recombinants. 
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The trans formants are selected by growth in an 
appropriate antibiotic (s) which, in the case of the fd- 
tet vector, is tetracycline. This may be done on solid 
or in liquid growth medium. For growth on solid medium, 
5 the cells are grown at a high density (~10 8 to 10 9 tfs 

per m 2 ) on a large surface of, for example, L-agar 
containing the selective antibiotic to form essentially a 
confluent lawn. The cells and extruded phage are scraped 
from the surface and phage are prepared for first round 

10 of panning essentially as described by Parmley and Smith, 

Gene 73:305-318 (1988), incorporated by reference herein. 
For growth in liquid culture, cells may be grown in L- 
broth and antibiotic through about 10 or more doublings. 
The phage are harvested by standard procedures (see 

15 Sambrook et al., (1989) Molecular Cloning , 2nd ed. 

(1989) , supra, for preparation of M13 phage) as further 
modified as described below. Growth in liquid culture 
may be more convenient because of the size of the 
libraries, while growth on solid media likely provides 

20 less chance of bias during the amplification process. 

For affinity enrichment of desired clones, 
about 10 3 to 10 4 library equivalents (a library 
equivalent is one of each recombinant — 10 4 equivalents 
of a library of 10 9 members is 10 9 x 10 4 = 10 13 phage) 

25 are incubated with hapten (ligand) to which the desired 

protein (e.g., antibody) is sought. The hapten is in one 
of several forms appropriate for affinity enrichment 
schemes. In one example the hapten is immobilized on a 
surface or particle, usually anchored by a tether of 

3 0 enough length (3 to 12 carbons, for example) to hold the 

hapten far enough away from the surface to permit free 
interaction with the antibody combining site. The 
library of phage bearing antibodies is then panned on the 
immobilized hapten generally according to the procedure 

35 described in the Example section below. 

A second example of hapten presentation is 
hapten attached to a recognizable ligand (again with a 
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tether of some length) . A specific example of such a 
ligand is biotin. The hapten, so modified, is incubated 
with the library of phage and binding occurs with both 
reactants in solution. The resulting complexes are then 
5 bound to streptavidin (or avidin) through the biotin 

moiety. The streptavidin may be immobilized on a surface 
such as a plastic plate or on particles, in which case 
the complexes (phage- ant ibody-hapten-biotin-streptavidin) 
are physically retained; or the streptavidin may be 

10 labelled, with a fluorophore, for example, to tag the 

active phage/ antibody for detection and/or isolation by 
sorting procedures, e.g., on a fluorescence - activated 
cell sorter. 

The phage bearing antibodies without the 

15 desired specificity are removed by washing. The degree 

and stringency of washing required will be determined for 
each protein of interest. A certain degree of control 
can be exerted over the binding characteristics of the 
antibodies recovered by adjusting the conditions of the 

20 binding incubation and the subsequent washing. The 

temperature, pH, ionic strength, divalent cations 
concentration, and the volume and duration of the washing 
will select for antibodies within particular ranges of 
affinity for the hapten. Selection based on slow 

25 dissociation rate, which is usually predictive of high 

affinity, is the most practical route. This may be done 
either by continued incubation in the presence of a 
saturating amount of free hapten, or by increasing the 
volume, number, and length of the washes. In each case, 

30 the rebinding of dissociated antibody-phage is prevented, 

and with increasing time, antibody-phage of higher and 
higher affinity are recovered. 

Additional modifications of the binding and 
washing procedures may be applied to find antibodies with 

35 special characteristics. The affinities of some 

antibodies are dependent on ionic strength or cation 
concentration. This is a useful characteristic for 
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antibodies to be used in affinity purification of various 
proteins when gentle conditions for removing the protein 
from the antibody are required. Specific examples are 
antibodies which depend on Ca ++ for binding activity and 
5 which released their haptens in the presence of EGTA. 

See Hopp et al., Biotechnology 6:1204-1210 (1988). Such 
antibodies may be identified in the recombinant antibody 
library by a double screening technique isolating first 
those that bind hapten in the presence of Ca ++ , and by 

10 subsequently identifying those in this group that fail to 

bind in the presence of EGTA. 

Antibodies with certain catalytic activities 
may be enriched in groups of antibodies with high 
affinity for reactants (substrates and intermediates) but 

15 low affinity for products. A double screen to enrich for 

antibodies with these characteristics may be useful in 
finding antibodies to catalyze certain reactions. 
Further, catalytic antibodies capable of certain cleavage 
reactions may also be selected. One category of such 

2 0 reactions is the cleavage of a specific end group from a 

molecule. For example , a catalytic antibody to cleave a 
specific amino acid from an end of a peptide may be 
selected by immobilizing the peptide and panning the 
antibody library under conditions expected to promote 
25 binding but not cleavage (e.g., low temperature, 

particular ionic strength, pH, cation concentration, 
etc., depending on the nature of the end group and the 
cleavage reaction) and followed by a wash. This allows 
antibodies that recognize the end group to bind and 

3 0 become immobilized, and from this group will come those 

capable of cleavage. To find those capable of cleavage, 
the conditions are shifted to those favorable for 
cleavage. This step will release those antibody-phage 
capable of cleaving themselves free of the immobilized 
35 peptide. 

An alternative way to accomplish this is to pan 
for antibodies that bind to the specific end group by 
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attaching that end group to a bond different from that to 
be cleaved (a non-peptide bond, for example) . By 
subsequent panning (of the positive phage from the first 
screen) on the end group attached via the proper bond 
5 under cleavage conditions , the non-binding fraction will 

be enriched for those with the desired catalytic 
activity. 

To elute the active antibody-phage from the 
immobilized hapten, after washing at the appropriate 

10 stringency, the bound (active) phage are generally 

recovered by eluting with pH shift. For example, pH2 or 
pHll may be used, which is then neutralized and the 
eluted phage are amplified by infecting or transforming 
the host cells. Examples of such hosts are E. coli , 

15 MC1061-F , KAN or K91. The cells are then grown as 

tetracycline resistant colonies. The colonies are 
scraped up and the extruded phage are purified by 
standard procedures as before. These phage are then used 
in another round of affinity enrichment (panning) , and 

20 this cycle is repeated until the desired level of 

enrichment is reached or until the target phage are no 
longer enriched relative to the background phage. 
Repeated rounds of panning and intervening amplifications 
can provide levels of enrichment exceeding 10 -fold. To 

25 isolate individual clones, phage from the final round of 

panning and elution are infected into cells or their DNA 
is transformed into cells and grown on agar (usually L- 
agar) and antibiotics (usually tet) to form well 
separated individual colonies, each of which is a clone 

30 carrying phage genomes with both V H and V L sequences. 

The single stranded DNA from phage particles extruded 
from each colony may be isolated and DNA coding for the 
V H and V L fragments sequenced. The replicative form of 
the phage DNA (double stranded) may be isolated by 

35 standard means and the DNA in the cloning sites (V H and 

V L sequences) recloned into a vector designed for gene 
product expression in prokaryotes or eukaryotes to obtain 
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larger amounts of the particular antibodies selected in 
the screening process. 

Phage identified as having an antibody 
recognized by the target ligand are propagated as 
appropriate for the particular phage vector used. For 
fd-tet this is done in a liquid culture of rich medium 
(L-broth, for example) with antibiotic (Tet) selection. 
The phage are harvested and DNA prepared and sequenced by 
standard methods to determine the DNA and amino acid 
sequence of the particular antibody. 

The DNA may be recloned in a suitable 
eukaryotic or prokaryotic expression vector and 
transfected into an appropriate host for production of 
large amounts of protein. Antibody is purified from the 
expression system using standard procedures. The binding 
affinity of the antibody is confirmed by well known 
immunoassays with the target antigen or catalytic 
activity as described in Harlow and Lane, Antibodies, A 
Laboratory Manual , Cold Spring Harbor, N.Y. (1988) , 
incorporated herein by reference. 

The following example is offered by way of 
illustration, not by way of limitation. 

EXAMPLE I 

This Example describes a procedure for 
isolating an antibody which binds to a preselected 
antigen, and thereby isolating the nucleotide sequences 
which encode the antibody. An immunoglobulin expression 
library is prepared from mouse spleen cells using a 
filamentous phage, fd. The antibody heavy chains are 
expressed as fusion proteins with coat protein pill. 
Phage particles which contain the antibody of the desired 
binding specificity are isolated by means of a panning 
procedure using the preselected antigen. 
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Construction of Vectors 

A filamentous bacteriophage vector, fdTetBl, 
was constructed from the tetracycline resistance 
transducing vector fdTet (Zacher et al., 1980). The 
5 cloning site in fdTetBl was engineered into the N- 

terminal region of gene III, and comprises two non- 
complementary BstXI sites separated by 18 bases. This 
was accomplished by first removing a BstXI restriction 
site that was already present in the TN10 region of 

10 fdTet. RF DNA was digested with BstXI restriction 

endonuclease, followed by the addition of T4 polymerase 
to remove the protruding 3 ■ termini . Blunt ended 
molecules were then ligated and el ectrotrans formed into 
MC1061. RF DNA was isolated from several tetracycline 

15 resistant trans formants and was digested with BstXI 

restriction endonuclease. A clone which was not digested 
with this enzyme was selected for insertion of the 
cloning site by site directed mutagenesis (Kunkel et al., 
Meth. Enzvmol .. 154:367-382 (1987)) with the 

20 ol igonucleotide 5 1 TATGAGGTTTTGCCAGACAACTGGAACAGTTTCAGCGGA 

GTGCC AGTAGAATGGAAC AACTAAAGG- 3 1 . Insertion of the correct 
mutagenic sequence was confirmed by dideoxy sequencing of 
RF DNA that was isolated from several tetracycline 
resistant transf ormants. 

25 

Construc tion of Degenerate Oligonucleotide Library 

Vector fdTetBSN was constructed to contain a 
pair of non-complementary Sfil sites in the TN10 region. 
This site is the ultimate destination of the expression 
30 cassette from the light chain library. fdTetBl was 

opened at the unique Hindlll site. Two synthetic, 
complementary oligonucleotides were kinased and annealed 
to form the structure 

AGCTGGCCGCAGCGGCCGCGGCCGCGGCCGGTCCGGCC 
3 5 CCGGCGTCGCCGGCGCCGGCGCCGGCCAGGCCGGTCGA 

This oligo was ligated to the Hindlll site of fdTetBl by 

standard methods. A properly ligated and circularized 

molecule should contain no Hindlll sites, so after 
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inactivating the ligase, the reaction was recut with 
Hindlll to linearize those plasmids that had not taken up 
an insert. This material was then transformed into E. 
coli and selected on tetracycline. The structure of the 
5 resulting molecules was verified and these were 

designated fdTetBSN. 

To construct the vector fdTetSXNS, fdTetBSN was 
opened at the BstXI sites and an oligonucleotide, 
annealed to give the following structure 

10 GCC TCG AGA GCA CGA CGT ACT AGT GCT TGT 

A GAT CGG AGC TCT CGT GCT GCA TGA TCA CG 

was ligated to the BstXI sites of fdBSN to provide unique 

Xhol and Spel sites to receive the heavy chain fragments. 

The plasmid vector pVL was constructed by 

15 digesting pUC19 with AlwNI and polishing the ends to 

bluntness. An Sfil site was introduced by ligating the 

annealed oligonucleotides 

GGCCGCAGCGGCC 

CCGGCGTCGCCGG 

20 to the blunted AlwNI site. 

The Aatll site of this modified pUC19 was 
opened and a fragment containing the light chain 
expression cassette described by Huse et al., flanked on 
the downstream side (with respect to transcription) by an 

25 Sfil site of the sequence GGCCGGTCCGGCC . This plasmid 

now contains (in clockwise direction) Sfil site, B- 
lactamase gene, a lac Z promoter, a ribosome binding 
site, a pel B signal peptide sequence, SacI site, Xbal 
site, translation termination site(s) , and Sfil site. 

30 

Construction of the separate he avy and light chain 
libraries 

cDNA sequences representing the antigen binding 
domains of the heavy and light antibody chains are 
35 synthesized from the RNA of antibody producing cells in 

the manner described by Huse et al., supra. Spleen cells 
are used from mice that have been immunized with the 
preselected antigen to which the desired antibody binds. 
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The cDNA is amplified by PCR using primers annealing to 
the regions common to many antibody sequences that flank 
the variable antigen binding domains. The primers also 
contain appropriate restriction site sequences as 
5 described below. The amplified, double-stranded 

fragments are digested with the appropriate restriction 
nucleases and ligated to the compatible sites in the 
respective vectors. For the heavy chain fragments this 
is accomplished by the incorporation of Xhol sites into 

10 the 5* -PCR primers and Spel sites into the 3 1 -primers. 

These sites are then exposed by digestion and ligated to 
the corresponding sites in the digested fdTetSXNS vector. 
The ligation products were transformed by electroporation 
into E. coli MC1061 and, after outgrowth, selected on 

15 tetr acy cl ine . 

For the light chain library, the light chain 
fragments are amplified with 5 '-PCR primers containing 
SacI sites and 3 "-primers containing Xbal sites. The 
sites on the fragments are opened and ligated to the 

20 corresponding sites in pVL. The ligation products are 

then transformed by electroporation into E. coli MC1061. 
After outgrowth, transformants were recovered by 
selection on ampicillin. 



25 Construction of the combined heavy-light chain expression 

library 

DNA from the light chain library (in pVL) was 
digested with Sfil. The larger of the two resulting 
fragments, which contains the light chain expression 

30 cassette, is isolated and ligated in several-fold molar 

excess to DNA of the heavy chain library (in fdTetSXNS) 
that has been digested with Sfil. The ligation products 
are transformed by electroporation into E. coli MC1061 
cells. After a period of outgrowth, the transformants 

35 are double-selected on ampicillin (100 ug/ml) and 

tetracycline (20 ug/ml) . The resulting transformants 
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constitute combined library of heavy and light chain 
fragments in fdTetSXNS. 

The methods of cDNA synthesis, restriction 
digestions, fragment phosphorylation, and ligation are 
5 essentially as prescribed in Sambrook et al., supra. In 

all cases, DNA to be transformed is ethanol precipitated 
in the presence of 0.3M sodium acetate and resuspended in 
water. Electrotrans formations are done as described by 
Dower et al . , supra . After 1 hour of non-selective 

10 outgrowth at 37 *C in SOC medium (2% Bacto tryptone, 0.5% 

Bacto yeast extract, lOmM NaCl, 2.5mM KC1, lOmM MgC12, 
lOmM MgS04, 20mM glucose) and an aliquot is removed and 
several dilutions are plated on LB plates containing the 
appropriate antibiotic (20 ug/ml tetracycline, 100 ug/ml 

15 ampicillin, or both) . The remainder of the 

transformation is used to inoculate one liter of L-broth 
containing the appropriate antibiotic and is grown for 
various times to amplify the library. Phage are isolated 
from this culture and stored at 4 # C. 

20 

Isolation of Phage 

Purified phage from liquid cultures are 
obtained by clearing the supernatant two times by 
centrifugation, and precipitating phage particles with 

25 polyethylene glycol (final concentration 3.3% 

polyethylene glycol 8000, 0.4M NaCl). Following 
centrifugation, phage pellets are resuspended in TBS (50 
mM Tris-HCl, pH 7.5, 150mM NaCl) and stored at 4* C. 
Phage are also isolated from plate stocks in this manner, 

30 after scraping colonies from the agar surface and 

resuspending in L-broth. 

Affinity Purification 

Approximately 10 3 - 10 4 library equivalents of 
35 phage are reacted overnight with 1 /xg purified antibody 

at 4° C. The mixture is panned by a procedure as 
follows. A 60 x 15 mm polystyrene petri plate is coated 
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with 1 ml of streptavidin solution (1 mg/ml in 0.1 M 
NaHC0 3/ pH 8.6, 0.02% NaN 3 ) and is incubated overnight at 
4° C. The following day the streptavidin solution is 
removed. The plate is filled with 10 ml blocking * 
5 solution (30 mg/ml BSA, 3 Mg/ml streptavidin in 0.1 M 

NaHC0 3 , pH 9.2, 0.02% NaN 3 ) and incubated for 2 hours at 5 
room temperature. Two micrograms of biotinylated goat 
anti-mouse IgG (BRL) are added to the antibody-reacted 
phage library and incubated for 2 hours at 4°C. 

10 Immediately before panning , blocking solution is removed 

from streptavidin coated plate, and the plate is washed 3 
times with TBS/0.05% Tween 20. The antibody- reacted 
phage library is then added to the plate and incubated 
for 30 minutes at room temperature. Streptavidin coated 

15 agarose beads (BRL) may also be used for this affinity 

purification. The phage solution is removed and the 
plate is washed ten times with TBS/0.05% Tween 20 over a 
period of 60 minutes. Bound phage are removed by adding 
elution buffer (1 mg/ml BSA, 0.1 N HCl, pH adjusted to 

20 2.2 with glycine) to the petri plate and incubating for 

10 minutes to dissociate the immune complexes. The 
eluate is removed, neutralized with 2M Tris (pH 
unadjusted) and used to infect log phase F 1 -containing 
bacterial cells. These cells are then plated on LB agar 

25 plates containing tetracycline (20 /xg/ml) , and grown 

overnight at 37 °C. Phage are isolated from these plates 
as described and the affinity purification process was 
repeated for two to three rounds. After the final round 
of purification, a portion of the eluate is used to 

30 infect cells and plated at low density on LB tetracycline 

plates. Individual colonies are transferred to culture 
tubes containing 2 ml LB tetracycline and grown to ^ 
saturation. Phage DNA is isolated using a method 
designed for the Beckman Biomek Workstation (Mardis and * 

35 Roe., Biotechniaues . 7:840-850 (1989)) which employs 96- 

well microtiter plates. Single stranded DNA is sequenced 
by the dideoxy method using Sequenase (U.S. Biochemicals) 
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and an oligonucleotide sequencing primer (5 1 - 
CGATCTAAAGTTTTGTCGTCT-3 1 ) which is complementary to the 
sequence located 40 nucleotides 3 1 of the second BstXI 
site in fdTetBl. 

5 It is evident from the above that compositions 

and methods are provided which substantially increase the 
ability to isolate nucleotide sequences which encode 
proteins of interest, particularly antibodies, from a 
large DNA library. This is especially encouraging, in 

10 that these methods and compositions may be employed to 

recover or produce de novo many proteins, and 
particularly monoclonal antibodies, useful as therapeutic 
or prophylactic compositions, diagnostic reagents, 
catalytic compounds, etc. previously obtainable only by 

15 e: tensive experimentation, if at all. The invention 

herein provides a means to circumvent many of the 
difficulties associated with traditional methods of 
monoclonal antibody technology, and particularly human 
monoclonal antibody technology. Further, the proteins 

2 0 identified by the present invention are produced by 

recombinant means, providing additional advantages of, 
inter alia , convenience, substantial purity and 
economics. 

Although the foregoing invention has been 
25 described in some detail by way of illustration and 

example for purposes of clarity of understanding, it will 
be obvious that certain changes and modifications may be 
practiced within the scope of the appended claims. 
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WHAT IS CL AIMED IS: 

1. A method for screening a DNA library for a 
nucleotide sequence which encodes a protein of interest, 
5 comprising: 

transforming a host cell with a bacteriophage 
expression vector which comprises (i) a DNA library 
member joined to a first nucleotide sequence encoding a 
tag protein, and (ii) a second nucleotide sequence 
10 encoding a tag ligand protein, which second sequence is 

joined to a third nucleotide sequence encoding a coat 
protein of the bacteriophage; 

cultivating the transformed cell under 
conditions suitable for expression and assembly of 
15 bacteriophage particles and the protein of interest; and 

selecting bacteriophage particles encoding the 
protein of interest by means of a ligand specific 
therefor. 

2 0 2. The method according to claim 1, wherein 

the bacteriophage vector is lambda. 

3. The method according to claim 2, wherein 
the lambda coat protein is gpD or gpE. 

25 

4. The method according to claim 1, wherein 
the protein that specifically binds the tag protein and 
the bacteriophage coat protein are expressed as a fusion 
protein. 

30 

5. The method according to claim 4, wherein 
the protein that specifically binds the tag protein is on 
an outer surface of the bacteriophage coat protein. 



35 



6. The method according to claim 1, wherein 
the tag protein and the protein of interest are expressed 
as a fusion protein. 
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7. The method according to claim 6, wherein 
the protein of interest comprises an antibody chain or a 
binding fragment thereof. 

5 

8. The method according to claim 7, wherein 
the fusion protein comprises the tag protein and an 
antibody heavy chain or variable region thereof. 

10 9. The method according to claim 8, wherein 

the tag protein is fused to the C-terminus of the heavy 
chain or variable region thereof. 

10. The method according to claim 7, wherein 
15 the tag protein is fused to the C-terminus of an antibody 

light chain or variable region thereof. 

11. The method according to claim 1, wherein 
the bacteriophage are harvested from the host cell 

20 culture before the selecting step. 

12. The method according to claim 1, wherein 
expression of the DNA library sequence members is 
inducible. 
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13. The method according to claim 12, wherein 
the induction of expression of the DNA library sequences 
is delayed until assembly of some bacteriophage particles 
has occurred. 

14. The method according to claim 1, wherein 
the transformed host cells are lysed after cultivation 
and the bacteriophage particles are selected from 
cellular debris. 

15 . The method according to claim 14 , wherein 
the bacteriophage particles encoding the protein of 
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interest are enriched by repeating the selection step at 
least once. 

16. The method according to claim 14, wherein 
an excess of tag protein is added to the culture of 
transformed host cells before lysis. 

17. The method according to claim 1, wherein 
the bacteriophage vector is a filamentous bacteriophage. 

18. The method of claim 17 , wherein the second 
nucleotide sequence encoding the tag ligand protein is 
cloned into the 5 » region of a nucleotide sequence 
encoding the bacteriophage coat protein. 

19. The method of claim 18, wherein the 
protein of interest comprises an antibody chain or a 
binding fragment thereof. 

20 20. The method according to claim 19, wherein 

the tag protein is fused to the C-terminus of an antibody 
heavy chain or variable region thereof. 

21. The method according to claim 20, wherein 
25 the bacteriophage vector further comprises a second DNA 

library member encoding an antibody light chain or a 
variable region thereof. 

22. The method according to claim 21, wherein 
3 0 the second DNA library member encoding the antibody light 

chain or variable region thereof is expressed with a 
signal peptide leader which directs secretion of the 
light chain into the periplasm of the transformed host 
cell. 



35 



23. The method of claim 1 further comprising 
the step of isolating the nucleotide sequences which 
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encode the protein of interest from the selected 
bacteriophage particles. 

24. A method for isolating a nucleotide 

5 sequence which encodes a protein of interest , comprising: 

transforming a host cell with a bacteriophage 
expression vector which comprises a DNA library member 
joined to a nucleotide sequence encoding a coat protein 
of the bacteriophage, wherein a coat protein-library 
10 protein fusion product is encoded; 

cultivating the transformed cell under 
conditions suitable for expression and assembly of 
bacteriophage particles; and 

selecting bacteriophage particles which encode 
15 the protein of interest. 

25. The method according to claim 24 , further 
comprising the step of isolating the nucleotide sequences 
which encode the protein of interest from the 

2 0 bacteriophage which have been selected. 

26. The method according to claim 24, wherein 
the bacteriophage is a filamentous phage. 

25 27. The method according to claim 26, wherein 

the filamentous bacteriophage is fl # fd, or M13. 

28. The method according to claim 2 6 , wherein 
the bacteriophage is fd or a derivative thereof. 

30 

29. The method according to claim 28, wherein 
the coat protein is pill. 

30. The method according to claim 26, wherein 
35 the DNA library member is inserted in the 5» region of 

the nucleotide sequence which encodes the bacteriophage 
coat protein. 
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31. The method according to claim 30, wherein 
the protein encoded by the DNA library member is present 
on an outer surface of assembled bacteriophage particles. 

5 

32. The method according to claim 24 , wherein 
the protein of interest comprises an antibody chain or a 
binding fragment thereof. 

10 33. The method according to claim 24, wherein 

the DNA library member comprises amplified cDNA encoding 
an antibody heavy chain or variable region thereof. 

34. The method according to claim 33, wherein 
15 the antibody heavy chain or fragment thereof encoded by 

the DNA library member is fused at a C-terminus to the 
bacteriophage coat protein. 

35. The method according to claim 33, wherein 
20 the bacteriophage vector further comprises a second DNA 

library member encoding an antibody light chain or a 
variable region thereof. 

36. The method according to claim 35, wherein 
25 the second DNA library member encoding an antibody light 

chain or variable region thereof is obtained from 
amplified cDNA. 

37. The method of claim 35, wherein the second 
30 DNA library member encoding an antibody light chain or 

variable region thereof is expressed with a signal 
peptide leader which directs secretion of the light chain 
into the periplasm of the transformed host cell. 
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38. The method of claim 37, wherein the leader 
is selected from the group consisting of Omp A, pel B, 
phoA, pill or 6-lactamase. 
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39. The method according to claim 24, wherein 
the bacteriophage are purified from the host cell culture 
before the selecting step. 

5 

40. The method according to claim 24, wherein 
expression of the DNA library sequences is inducible. 

41. The method according to claim 24, wherein 
10 the bacteriophage which encode the protein of interest 

are selected by means of a ligand specific for the 
protein of interest. 

42. The method of claim 41, wherein the 

15 protein of interest is an antibody and the ligand is an 

antigen specifically bound by said antibody. 

43. The method according to claim 24, wherein 
the bacteriophage particles encoding the protein of 

20 interest are enriched by repeating the selection step at 

least once. 

44. A composition comprising a protein of 
interest produced according to the method of claim 1, 17 

25 or 24. 

45. The composition of claim 44, wherein the 
protein of interest is an antibody molecule or binding 
fragment thereof. 
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46. The antibody of claim 45 wherein the 
antibody binds to a preselected antigen. 
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